Simultaneous Clustering and Feature Ranking by Competitive Repetition Suppression Learning with Application to Gene Data Analysis
نویسندگان
چکیده
The paper presents feature-wise Competitive Repetition-suppression (CoRe) clustering, a novel unsupervised algorithm that deals with the automatic determination of the unknown cluster number and simultaneous feature ranking. The proposed model addresses the limitations of the original CoRe learning algorithm when dealing with high dimensional data, extending the repetition suppression competition on a feature-wise basis. The effectiveness of the approach is tested on gene expression data from DNA microarrays: the results show that the feature-wise CoRe clustering algorithm is able to detect the known data partitioning in a completely unsupervised fashion. Moreover, it simultaneously develops a gene ranking that is consistent with the state-of-the-art list of gene relevance for the selected benchmark datasets.
منابع مشابه
Feature-wise Competitive Repetition Suppression Learning for Gene Data Clustering and Feature Ranking
The paper extends Competitive Repetition-suppression (CoRe) learning to deal with high dimensional data clustering. We show how CoRe can be applied to the automatic detection of the unknown cluster number and the simultaneous ranking of the features according to learned relevance factors. The effectiveness of the approach is tested on two freely available data sets from gene expression data and...
متن کاملA Fuzzy C-means Algorithm for Clustering Fuzzy Data and Its Application in Clustering Incomplete Data
The fuzzy c-means clustering algorithm is a useful tool for clustering; but it is convenient only for crisp complete data. In this article, an enhancement of the algorithm is proposed which is suitable for clustering trapezoidal fuzzy data. A linear ranking function is used to define a distance for trapezoidal fuzzy data. Then, as an application, a method based on the proposed algorithm is pres...
متن کاملSupervised Clustering of Label Ranking Data
In this paper we study supervised clustering in the context of label ranking data. Segmentation of such complex data has many potential real-world applications. For example, in target marketing, the goal is to cluster customers in the feature space by taking into consideration the assigned, potentially incomplete product preferences, such that the preferences of instances within a cluster are m...
متن کاملMOSCFRA: A Multi-objective Genetic Approach for Simultaneous Clustering and Gene Ranking
Microarray experiments generate a large amount of data which is used to discover the genetic background of diseases and to know the gene characteristics. Clustering the tissue samples is an important tool for partitioning the dataset according to co-expression patterns. This clustering task is even more difficult when we try to find the rank of each gene (Gene Ranking) according to their abilit...
متن کاملModification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis
Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007